GPU VSIPL: Core and Beyond
نویسندگان
چکیده
GPU VSIPL is an implementation of the Vector Signal Image Processing Library (VSIPL) and provides a librarybased solution to developing GPU-accelerated numerical applications. GPU VSIPL now supports much of the VSIPL Core Profile as well as element-wise matrix operators corresponding to required vector operators. This is nearly a five-fold increase in function count since GPU VSIPL was first introduced and includes numerous high-level numerical procedures. GPU VSIPL is implemented by optimized kernels written in the CUDA programming language. It provides support for real and complex matrix and vector operations; signal processing operations such as FIR, FFT, correlation, and histogram; and linear algebra operations such as matrix product and QR decomposition. GPU VSIPL achieves performance comparable to CUBLAS without encumbering the developer with CUDA-specific function calls and is link-compatible with existing VSIPL applications.
منابع مشابه
GPU VSIPL: High-Performance VSIPL Implementation for GPUs
In this paper, we introduce GPU VSIPL1, an implementation of the Vector Signal Image Processing Library (VSIPL) Core Lite profile developed for the graphics processing unit (GPU). This implementation achieves a speedup of one to two orders of magnitude over the reference implementation. Because VSIPL is an open standard for high-performance platform-independent applications, GPU VSIPL provides ...
متن کاملScalable SAR with Sourcery VSIPL++ for the Cell/B.E
Sourcery VSIPL++ for the Cell/B.E. [1][2] is an implementation of the open standard VSIPL++ signal and image-processing API [1] on the IBM Cell/B.E. multi-core processor architecture [4]. It is suitable for implementing high-performance signal-processing applications that take full advantage of the Cell/B.E. processor throughput, without sacrificing programmer productivity or application portab...
متن کاملOptimization of Memory Allocation in VSIPL
In this paper, we propose an efficient memory allocation algorithm for Vector, Signal, and Image Processing Library (VSIPL) standard. This algorithm improves the efficiency of key VSIPL functions by exploiting fundamental properties of VSIPL objects during the allocation of memory. We implemented and tested our algorithm using the VSIPL reference implementation and measured results on represent...
متن کاملAn Approach in Radiation Therapy Treatment Planning: A Fast, GPU-Based Monte Carlo Method
Introduction: An accurate and fast radiation dose calculation is essential for successful radiation radiotherapy. The aim of this study was to implement a new graphic processing unit (GPU) based radiation therapy treatment planning for accurate and fast dose calculation in radiotherapy centers. Materials and Methods: A program was written for parallel runnin...
متن کاملVSIPL++Pro – A High-Performance VSIPL++ Implementation
VSIPL++ achieves high-performance through both the efficiency of C++ as a systems language, and by using C++ features for high-performance generic programming to convert domain-level abstractions into efficient code at compile time. The VSIPL++ API allows for early binding of computation and communication, allowing setup for complex functions and communications to be done early, out of critical...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009